Structural feature-driven pattern analysis for multitarget modulator landscapes

Abstract Motivation Multitargeting features of small molecules have been of increasing interest in recent years. Polypharmacological drugs that address several therapeutic targets may provide greater therapeutic benefits for patients. Furthermore, multitarget compounds can be used to address proteins of the same (or similar) protein families for their exploration as potential pharmacological targets. In addition, the knowledge of multitargeting features is of major importance in the drug selection process; particularly in ultra-large virtual screening procedures to gain high-quality compound collections. However, large-scale multitarget modulator landscapes are almost non-existent. Results We implemented a specific feature-driven computer-aided pattern analysis (C@PA) to extract molecular-structural features of inhibitors of the model protein family of ATP-binding cassette (ABC) transporters. New molecular-structural features have been identified that successfully expanded the known multitarget modulator landscape of pan-ABC transporter inhibitors. The prediction capability was biologically confirmed by the successful discovery of pan-ABC transporter inhibitors with a distinct inhibitory activity profile. Availability and implementation The multitarget dataset is available on the PANABC web page (http://www.panabc.info) and its use is free of charge. Supplementary information Supplementary data are available at Bioinformatics online.


Introduction
In silico virtual screening approaches to identify bioactive compounds have evolved over the last few years and became fundamental for novel drug discovery. Chemical space has exponentially increased, revealing virtually endless possibilities in molecularstructural variation and composition of potentially bioactive compounds. For example, the Enamine REAL SPACE V R virtual compound library (https://enamine.net) consists, to this date, of over 21 billion compounds that may be obtained by organic (retro-)synthesis. Virtual screenings of this large chemical space have become not only an opportunity but also a challenge. Countless screening tools have been developed to extract putatively bioactive compounds from virtual compound libraries, but lack nowadays the necessary discriminatory potential to downsize large datasets of virtual compounds to gain high-quality compound collections.
'Multitarget datasets' may hold the key for sufficient discrimination of potential drug candidates. Here, the biological information of a set of known modulators of a pharmacological target is complemented with the biological information of (an)other (related or nonrelated) pharmacological target(s). Depending on the aimed pharmacological profile, e.g. selectivity or promiscuity, discriminators may be developed for further compound extraction. Unfortunately, such multitarget datasets are barely available (Namasivayam et al., 2021a). We have recently reported a multitarget dataset of focused pan-ATP-binding cassette (ABC) transporter inhibitors (Namasivayam et al., 2021c) that target the well-studied ABC transporters ABCB1, ABCC1 and ABCG2 (Namasivayam et al., 2021a). These particular transport proteins constitute a model biological system for multitargeting for three reasons: (i) all three members have extensively been studied and hundreds of compounds exists that directly interact with each of the transporters; (ii) a large number of compounds exists that have been evaluated on all three targets; and (iii) a sufficient number of compounds exists with overlap in affinity regarding two or all three transporters, enabling to focus on multitargeting.
In-depth analysis of the molecular-structural patterns of the multitarget dataset resulted in the discovery of substructures with positive and negative impact on multitarget inhibition (Namasivayam et al., 2021a,b). However, these extracted substructures account only for a part of the multitarget modulator landscape and do not provide sufficient discriminatory potential to downsize ultra-large virtual compound libraries.
This becomes evident considering that the Enamine drug-like V R virtual compound library, which was used in the aforementioned study (Namasivayam et al., 2021a), revealed 1505 putatively bioactive candidates out of an initial virtual library dataset of around 16 million compounds. Comparing this initial virtual library dataset with the nowadays available ultra-large virtual library dataset of 21 billion compounds (https://enamine.net), a similar screening of the latter dataset would lead to hundreds of thousands of potential drug candidates from which manual selection is necessary. Manual selections are important to account for the empirical experience of researchers. However, mental processing of tens of thousands of compounds with consistent criteria is impossible.
In this work, we present a novel molecular-structural feature-driven pattern analysis (C@PA) workflow to explore the edges of the multitarget modulator landscape. As the prediction of highly active compounds (IC 50 < 10 mM) in the 'Inner Multitarget Modulator Landscape' has already been achieved previously (Namasivayam et al., 2021a), this study focused the discovery of critical substructures in the 'Outer Multitarget Modulator Landscape' (IC 50 ¼ 10-200 mM). Given the fact that a multitarget dataset of the well-studied ABC transporters ABCB1, ABCC1 and ABCG2 exists and the functional activity of these transporters can easily be assessed in well-established assays, they represent a model biological system for the intended studies. The landscape was used for the prediction of bioactive compounds with distinct bioactivity range and was biologically validated.

Computational analyses
Compilation of dataset. The multitarget dataset comprising 1049 manually assembled and curated compounds as reported in the literature (Namasivayam et al., 2021a) has been updated by adding 111 compounds that were recently evaluated on the targets ABCB1, ABCC1 and ABCG2 (Antoni et al., 2021;Namasivayam et al., 2021b;Telbisz et al., 2021;Vagiannis et al., 2020). From this dataset of in total 1160 compounds, 138 compounds could be extracted that target all three focused targets, ABCB1, ABCC1 and ABCG2, which are termed in the following as 'focused pan-ABC transporter inhibitors' (Namasivayam et al., 2021c), with IC 50 values < 200 mM. In addition, a list of 304 compounds could be extracted that had either no activity (> 200 mM) or were only active against one or two transporters with IC 50 values ! 10-200 mM. These compounds were termed in our previous report as 'Class 0' compounds (Namasivayam et al., 2021a).
A scaffold search was conducted using the Structure-Activity-Report (SAReport) tool (Clark and Labute, 2009) implemented in Molecular Operating Environment (MOE; version 2019.01). In total, 103 relevant substructures were extracted from the literature (Namasivayam et al., 2021a,b) as well as the present analyses. A binary distribution scheme of these substructures was established using InstantJChem (version 20.15.0). The percentage distribution of each substructure amongst the focused compounds was calculated according to formula (1).
The ratios between each group of compounds were calculated. Supplementary Table S1 provides the names and SMILES of the used compounds and substructures.
Virtual Screening. In a recent report, we generated an ABCC1focussing dataset comprising 1510 compounds which turned out to be an optimal virtual screening dataset due to its known multitarget hit rate of 23.5% and its non-substructure-based selection criteria (Silbermann et al., 2019). It has already been established as virtual screening dataset (Namasivayam et al., 2021b) and was used in this study. A virtual screening protocol was implemented applying defined conditions using InstantJChem (version 20.15.0). A manual selection of the resultant potential hit compounds provided compounds 1-5 (Supplementary Table S2) which were purchased from MolPort V R (http://www.molport.com).
All cell lines were stored in liquid nitrogen in a cell culture media/DMSO mixture (90%/10%). Cultivation was accomplished at 37 C under 5% CO 2 -humidified atmosphere. At a confluence of 90% cells were harvested using a trypsin-EDTA solution (0.05%/ 0.02%; PAN-Biotech GmbH, Aidenbach, Germany and Biowest, Nuaillé, France). The cells were centrifuged and the supernatant was removed, followed by resuspension of the cells in fresh cell culture media and cell counting using either a CASY TT (150 mm capillary; Schä rfe System GmbH, Reutlingen, Germany) or a Scepter handheld automated cell counter (60 mM capillary sensor; EMD Millipore, Billerica, MA, USA). Subsequent seeding of cells for either subculturing or biological testing followed.
Calcein AM assay. The calcein AM assay was conducted as reported earlier (Namasivayam et al., 2021a,b;Silbermann et al., 2019;Stefan et al., 2017). For the initial screening, 20 ml of compounds 1-5 (100 mM) were added into 96-well flat-bottom clear plates (Greiner, Frickenhausen, Germany) followed by addition of 160 ml of cell suspension containing A2780/ADR (30 000 cells/well) or H69AR (60 000 cells/well) cells and incubation at 37 C and 5% CO 2 -humidified atmosphere for 30 min. Calcein AM (20 ml/3.125 mM) was added to each well and the fluorescence increase (excitation: 485 nm; emission: 520 nm) was measured in 60 sec intervals for 1 h in POLARstar and/or FLUOstar Optima microplate readers (BMG Labtech, Offenburg, Germany) and compared to the reference inhibitors 6 (ABCB1) and 7 (ABCC1). Compounds that resulted in a significantly different fluorescence increase compared with no inhibition (pure cell culture media; bottom value) were evaluated in concentration-effect curves (1-10 mM final concentration). Data processing including determination of IC 50 values was accomplished using GraphPad Prism (version 8.4.0; San Diego, CA, USA), and the resultant concentration-effect curves have been constrained to the maximal effect of standard inhibitors 6 (ABCB1) and 7 (ABCC1).
Pheophorbide A assay. The pheophorbide A assay was conducted as reported earlier (Namasivayam et al., 2021a,b;Silbermann et al., 2019). Compounds 1-5 (100 mM) were added into 96-well flat-bottom clear plates followed by addition of 160 ml of cell suspension containing MDCK II BCRP (45 000 cells/well) and 20 ml of a pheophorbide A solution (5 mM) with subsequent incubation at 37 C and 5% CO 2 -humidified atmosphere for 120 min. Flow cytometry (Guava easyCyte TM HT; Merck Millipore, Billerica, MA, USA; excitation: 488 nm; emission: 695/50 nm) was applied, and the average fluorescence value per cell per well was correlated to the used compound concentration of compounds 1-5 as well as reference inhibitor 8. Compounds that resulted in a significantly different fluorescence increase compared to no inhibition (pure cell culture media; bottom value) were evaluated in concentrationeffect curves (1-10 mM final concentration). Data processing was conducted as stated above, and the resultant concentration-effect curves have been constrained to the maximal effect of standard inhibitor 8.
Daunorubicin assay. For the verification of determined IC 50 values against ABCB1 and ABCC1 in the calcein AM assay and to exclude unspecific effects, the daunorubicin accumulation assay was performed as described earlier (Jekerle et al., 2006;Namasivayam et al., 2021a). Dilution series of hit compounds 3 and 5 between concentrations of 5-300 mM have been generated and 20 ll of the compounds were added into clear 96-well flat-bottom plates (Greiner, Frickenhausen, Germany) and complemented with 160 ll of a cell suspension containing 45 000 cells of either A2780/ADR, A2780, H69AR or H69 cell lines in colorless RPMI-1640 without supplements. Twenty microliters of a daunorubicin solution (30 mM) were added and the plate was incubated for 180 min at 37 C and a 5% CO 2 humidified atmosphere. Fluorescence was measured with an Attune NxT flow cytometer (Invitrogen, Waltham, MA, USA) at an excitation wavelength of 488 nm and an emission wavelength of 695/50 nm. Data analysis was performed as described before, and the resultant concentration-effect curves have been constrained to the maximal effect of standard inhibitors 6 (ABCB1) and 7 (ABCC1).
Hoechst 33342 assay. To verify the inhibition against ABCG2 and to exclude unspecific effects of compounds 3 and 5 obtained in flow cytometry, we performed a plate reader-based Hoechst 33342 assay as reported earlier (Namasivayam et al., 2021a). Dilution series of test compounds 3 and 5 between concentrations of 5-300 mM have been generated and 20 ll of the test compounds were added into black 96-well flat-bottom plates (Brand, Wertheim, Germany) and complemented with 160 ll of a cell suspension containing 30 000 cells of either MDCK II BCRP or MDCK II cells in colorless DMEM without supplements. After an incubation period of 30 min at 37 C and 5% CO 2 , 20 ml of a Hoechst 33342 solution (10 lM) were added. The fluorescence accumulation at an excitation wavelength of 360 nm and an emission wavelength of 460 nm was measured over a period of 120 min in 60 s intervals using a Paradigm microplate reader [Beckman Coulter (Brea, CA, USA)]. The average steady-state fluorescence value was calculated for each well and correlated to the used compound concentration. Data analysis was performed as described before, and the resultant concentrationeffect curves have been constrained to the maximal effect of standard inhibitor 8.
Fluorimetric analyses. Compounds 3 and 5 (5-30 mM final concentration) were subject to fluorescence measurements for the determination of potential auto-fluorescence. Excitation wave-lengths have been analyzed that were used in the described assays above [360 (Hoechst 33342) and 485 (calcein AM, daunoruicin, pheophorbide A)] by acquiring complete spectra covering a wavelength range between 400-700 and 525-700 nm, respectively, using a Paradigm microplate reader [Beckman Coulter (Brea, CA, USA)]. Neither compound 3 nor compound 5 showed auto-fluorescence at concentrations up to 30 mM (data not shown).

Classification of compounds
As specified in Section 2.1, 138 focused pan-ABC transporter inhibitors were extracted from the multitarget dataset (Antoni et al., 2021;Namasivayam et al., 2021a,b;Telbisz et al., 2021;Vagiannis et al., 2020;Fig. 1A). These compounds were classified according to their activity profile (from very strong to very weak inhibition; Fig. 1B). In our earlier report, compounds with IC 50 values < 10 mM were designated as 'Class 7' compounds (Namasivayam et al., 2021a). In the 138 focused pan-ABC transporter inhibitors, 56 Class 7 compounds exist. However, to acknowledge the different inhibition values and to allow for a detailed analysis, we sub-categorized these compounds into 'Superior Class 7' (IC 50 values against all targets < 5 mM; 22 of 138 compounds) and 'Medium Class 7' compounds (all IC 50 values < 10 mM and at least one IC 50 value between 5 and < 10 mM; 36 of 138 compounds). In addition, we established 'Semi Class 7' compounds in our previous work that had significant potency against all three selected targets, ABCB1, ABCC1 and ABCG2, however, did not qualify for class 7 compounds (Namasivayam et al., 2021b): all IC 50 values 15 mM and at least one IC 50 value 10-15 mM; 22 of 138 compounds. The compound classification was completed by defining weak and very weak focused pan-ABC transporter inhibitors. 'Weak Pan-ABC' transporter inhibitors were defined by IC 50 values against two transporters ! 20 mM and against one transporter < 10 mM, which accounted for 8 of 138 compounds). 'Very Weak Pan-ABC' transporter inhibitors were defined by IC 50 values against two transporters ! 20 mM and against one transporter ! 10 mM; 6 of 138 compounds). In summary, 92 (22 þ 34 þ 22 þ 8 þ 6) of the 138 focused pan-ABC transporter inhibitors could be categorized according to the described scheme, which will be termed in the following as 'classified focused pan-ABC transporter inhibitors'. On the other hand, 46 of 138 compounds could not be taken into account in the following analyses and were termed as 'unclassified'.
In addition to the collection of pan-ABC transporter inhibitors, 304 compounds could be extracted from the multitarget dataset that had either (almost) no activity toward the transporters ABCB1, ABCC1 and ABCG2 (IC 50 > 200 mM; 118 of 304 compounds) or affected maximally 2 transporters with activity values between 10-200 mM (186 of 304 compounds; Antoni et al., 2021;Namasivayam et al., 2021b;Telbisz et al., 2021;Vagiannis et al., 2020;Fig. 1C). These compounds are referred to in the following as 'Class 0' compounds according to our previous report (Namasivayam et al., 2021a). However, to enable for a detailed analysis of this particular set of compounds, the 118 compounds without any activity were designated as 'Real Class 0' compounds, while the 186 compounds were named 'Semi Class 0' compounds. Figure 1D summarizes the criteria of compound classification for subsequent statistical pattern analysis.

Compilation of relevant substructures
Our previous reports have revealed several critical substructures in terms of multitarget inhibition of the transporters ABCB1, ABCC1 and ABCG2, which can generally be distributed into two different groups: (i) the 'positive multitarget substructures' that promote pan-ABC transporter inhibition; and (ii) the 'negative multitarget substructures' which impedes pan-ABC transporter inhibition (Namasivayam et al., 2021a,b). Taking our previous literature reports into account, the positive multitarget substructures consist of (i) 6 'primary basic scaffolds', (ii) 2 'suggested basic scaffolds', (iii) 7 'primary positive substructures'; (iv) 7 'suggested positive substructures', (vi) 15 'extended positive substructures' and (vii) 9 'putative positive substructures'. On the other hand, the negative multitarget substructures comprise 30 'primary negative substructures'. A detailed survey of these substructures and their allocated group can be found in Supplementary Table S3.
In our most recent work, we suggested putative positive substructures that were defined as partial structures which were present in at least 10 Class 7 compounds (Namasivayam et al., 2021b). However, no analogous concept has been developed for negative multitarget substructures. Hence, we re-analyzed the multitarget dataset (Namasivayam et al., 2021a) toward 'putative negative substructures', and identified 24 additional substructures that complement the negative multitarget substructures according to a rule scheme as outlined in Supplementary Information S1.
In total, 103 substructures have been compiled from the literature (Namasivayam et al., 2021a,b) and this study (Supplementary  Table S3), which were used for subsequent analyses.

Individual pattern analysis and substructural re-grouping
To obtain a detailed understanding of the individual contribution of each of the 103 substructures to multitarget inhibition of the ABC transporters ABCB1, ABCC1 and ABCG2 and to identify important substructures in terms of the multitarget modulator landscape, an individual evaluation of the statistical distribution of these 103 substructures amongst the 92 classified focused pan-ABC transporter inhibitors of ABCB1, ABCC1 and ABCG2, as well as the 304 Class 0 compounds for each substructure has been conducted. This 'individual pattern analysis' followed a rule scheme as outlined in Supplementary Information S2, and the individual substructures were evaluated and re-grouped according to a point score (þþþ; þþ; þ; 0) depending on the (non-)violation of the rules as outlined in Supplementary Information S3.
Considering the point scores, the following groups were defined in which the 103 analyzed substructures could be re-distributed:

Virtual screening for the outer multitarget modulator landscape
To support the presented discoveries, we implemented a virtual screening protocol specifically to discover compounds of the Outer Multitarget Modulator Landscape. For this purpose, the already established virtual screening dataset of 1510 compounds was applied as described before (Namasivayam et al., 2021b;Silbermann et al., 2019). This particular dataset has a known multitarget hit rate of 23.5% and was not constrained to C@PA-derived inclusion-and/or exclusion criteria.

Biological validation of the outer multitarget modulator landscape
Calcein AM (ABCB1 and ABCC1) and pheophorbide A (ABCG2) fluorescence accumulation assays were used to evaluate the inhibitory feature of compounds 1-5. For this purpose, ABCB1overexpressing A2780/ADR, ABCC1-overexpressing H69AR, and ABCG2-overexpressing MDCK II BCRP were used as reported earlier (Namasivayam et al., 2021a,b;Silbermann et al., 2019;Stefan et al., 2017). Figure 5 shows the results of the initial screening of the compounds at 10 mM, and Table 1 reveals the inhibitory activity data of the compounds that resulted in a significant fluorescence increase compared to no inhibition (pure cell culture media; bottom value). The obtained IC 50 values have been compared to very weak pan-ABC transporter inhibitors known in the literature (6 compounds; Colabufo et al., 2009;Lempers et al., 2016;Obreque-Balboa et al., 2016).
Compounds 1-5 had at least inhibitory activities against two targets, and hence, revealed themselves as multitarget inhibitors. This equals a multitarget hit rate of 100%. This multitarget hit rate exceeded the determined multitarget hit rates of our previous studies [70.0% (Namasivayam et al., 2021b) and 60.9% (Namasivayam et al., 2021a)]. Two candidates, 3 and 5, targeted ABCB1, ABCC1 and ABCG2, which represents a hit rate of 40%. This hit rate is comparable to the hit rate of our previous study (Namasivayam et al., 2021b), and higher than similar virtual screening approaches [21.7% (Namasivayam et al., 2021a) and 23.5% (Silbermann et al., 2019)]. The standout findings are the activity ranges of these  Landscape. In addition, the measure to include one tolerated negative multitarget substructure to acknowledge certain bioactivities of Semi Class 0 compounds proved to be correct. Tetrazole, a substructure mostly occurring in real class 0 compounds and rarely present in Semi Class 0 compounds, can now be considered as an 'active substructure' in terms of multitargeting of ABC transporters.
To confirm our results and to exclude unspecific effects or assayrelated artifacts in terms of the hit compounds 3 and 5, we determined inhibition values in alternative assays. For this purpose, the flow cytometry-based dauorubicin accumulation assay (ABCB1 and ABCC1) as well as the plate reader-based Hoechst 33342 assay have been chosen. In addition, we investigated compounds 3 and 5 at various concentrations in sensitive/wild-type cell lines (A2780, H69 and MDCK II). In essence, the inhibitory power against the targets ABCB1, ABCC1 and ABCG2 could be confirmed in all three cases (Table 1), and no unspecific effects were identified after taking the data regarding the sensitive/wild-type cell lines into account. Supplementary Figure S1 demonstrates the concentration-effect curves generated for the most potent representative of the newly derived very weak pan-ABC transporter inhibitors, compound 5.

Conclusions
At first glance, the intended search and prediction of weak multitargeting compounds seems counterintuitive. In the past, virtual screening approaches focused on the identification of potential drug candidates with high potency toward their target(s) (Namasivayam et al., 2021a,b;Silbermann et al., 2019). However, the goal of this study was to address the question of whether additional descriptional substructures could be identified and/or subsequently developed that define the multitarget modulator landscape in its entirety, and may be used to search for a specific activity range of bioactive compounds. The multitarget modulator landscape is particularly of major interest due to five reasons.
First, with the advance of chemical space, new descriptors are needed to downsize ultra-large virtual compound libraries. Conventional screening methodologies with rather simple selection frameworks will not be sufficient in the future to efficiently reduce the absolute number of potential drug candidates. Indeed, countless screening tools have been identified in the last 2-3 decades, and the number of descriptors is almost endless. However, certain descriptors used as additional discriminators may not be directly connected to bioactivity and could lead to false selections or even to no discrimination in terms of the outgoing virtual screening dataset. This could lead to an insufficient reduction of potential drug candidates, and hence, an immense output of compounds that need to be processed manually. As outlined in the introduction, manual selections support the drug selection process through empirical experience of researchers. However, this process should be reserved for the final shortlisted compounds, not tens of thousands of compounds, to keep selection criteria consistent and prevent irrational or random selection. Rationalizing these decisions to the very end is the major obstacle in computational chemistry today.
In terms of the selection process, the Outer Multitarget Modulator Landscape in general can serve in two ways: (i) as an additional descriptor to target a specific bioactivity range and/or to support the extraction of the potentially best compounds for in vitro and subsequently in vivo experiments, as well as (pre-)clinical evaluation of a specific virtual screening approach; and (ii) to create highquality compound collections that serve multifactorial purposes. These compound collections are pre-selected compound sets with target-related determinants from which an actual screening for bioactivity takes place, for example, to establish new screening methodologies.
Very recently, C@PA was introduced as a novel screening methodology ultimately identifying substructures of compounds which were linked to bioactivity, and developing these substructures as descriptors for virtual screening approaches (Namasivayam et al., 2021a). Their subsequent extension proved how powerful substructures really are (Namasivayam et al., 2021b). Through the presented workflow, several substructures could be identified that outline and mark the very edges of the Outer Multitarget Modulator Landscape, which qualify as additional descriptors for pre-selecting ultra-large virtual screening compound libraries for the generation of highquality compound collections, and ultimately the final rational drug selection process. In addition, the presented individual statistical pattern analysis will promote fragment-based de novo computational drug design approaches.
Second, the interest in polypharmacology has risen in recent years (Bajorath, 2021;Blaschke and Bajorath, 2021;Feldmann and Bajorath, 2020;Stefan, 2019;Stefan et al., 2020a). The presented methodology proved to be superior in extracting multitargeting compounds, which may be transferred to other fields of pharmacology as well. However, another reason to target the Outer Multitarget Modulator Landscape could be the development of novel agents of targets that are under-studied and cannot be addressed yet by small molecules. These small molecules may be promising for the exploration and exploitation of novel pharmacological targets and the development of new diagnostics and therapeutics against human diseases. With the knowledge of the Outer Multitarget Modulator Landscape, new template structures can be derived, which may subsequently be improved by derivatization, initiating rational drug design. In the case of the model biological system of ABC transporters, this multitarget exploration concept has very recently been introduced, and (a) similar or at least overlapping   5. Investigation of compounds 1-5 (10 mM) against ABCB1 and ABCC1 (calcein AM assay), and ABCG2 (pheophorbide A assay). Normalization was conducted by defining the effect of 10 mM of compounds 6 (ABCB1), 7 (ABCC1) and 8 (ABCG2) as 100% and pure cell culture medium as 0%. The mean values 6 standard error of the mean (SEM) of at least three independent experiments are shown. Red mark: discovered Outer Multitarget Modulator Landscape-focused pan-ABC transporter inhibitors 3 and 5; a apparent ABCC1 activation (5.7% 6 2.7%); b apparent ABCC1 activation (7.0% 6 2.8%) common binding site(s) of pan-ABC transporter inhibitors across different ABC transporter subfamilies has been proposed (Namasivayam et al., 2021a,b,c;Stefan et al., 2020a). This 'multitarget binding site' may hold the key to address under-studied ABC transporters and target under-studied ABC transporters-associated diseases.
Third, drug development does not only focus on polypharmacology but also on selective compounds. However, to generate purely selective compounds, the knowledge of both the inner and outer bioactivity landscape of these compounds toward several (related or unrelated) targets is critical. The presented workflow allows for the identification of these critical substructures to design compounds with the intended pharmacological profile. The outer bioactivity landscape is particularly of interest to avoid potential side-effects in (pre-)clinical evaluations. In terms of interference with ABC transporters, this extended pharmacological profile is of specific importance. ABC transporters are mainly expressed in blood-tissue barriers, and hence, majorly influence the distribution, pharmacokinetic, and therefore, pharmacodynamic of drugs. The design of, or screening for, drug candidates that do not bear features of the Inner and Outer Multitarget Modulator Landscape of pan-ABC transporter inhibition is highly preferable in the drug design processes.
Fourth, computational chemistry approaches with respect to the exploration of the bioactivity landscapes of compound datasets support the exploration of the bioactivity space in the vastly growing chemical space. The great advantage of virtual screening approaches is the almost certain discovery of novel compounds with unique molecular-structural composition and orientation. This also leads to the discovery of novel mode of actions. For example, one compound of our previous study (Namasivayam et al., 2021b) and two of this study (compounds 1-2) revealed an apparent slight activation of ABCC1. Activators of this ABC transporter are indeed known Wiese and Stefan, 2019), however, only very few compounds had distinct molecular features apart from known intrinsic ABCC1 substrates. The discovery and exploration of novel substructures associated with different modes of modulation will allow for the exploration of multiple dimensions of bioactivity space, for example, functional (mode of modulation) and structural (mode of binding) aspects. In terms of ABC transporters, the active/intended search of ABC transporter activators is a current research field that would benefit from these developments.
Fifth, feature-driven C@PA ('C@PA_1.3') offers the opportunity to establish certain substructures with distinct pharmacological profile into a collective set of substructures for subsequent screening of virtual compound libraries. This may be of importance, for example, if different substructure sets targeting different (related or unrelated) targets, shall be combined to pursue a polypharmacological approach. Here, a 'collective pattern analysis' combining two different sets of substructures may reveal the positive or negative impact of (a) certain introduced substructure(s) on the bioactivity range of a training dataset. Also, the introduction of new or negative substructures (e.g. tetrazole) that are not associated with (a) certain target(s) may be accomplished with collective pattern analysis. This in effect shapes not only the future substructure composition in virtual screening approaches, but also the extracted compounds derived from these virtual screening approaches. These measures promote a steady re-consideration and re-evaluation of existing datasets, thriving novel and innovative drug development.
The virtual screening workflow of the future is suggested to comprise (i) an enlarged substructure dataset. The more substructures are searched for, the more discriminatory potential exists to downsize ultra-large virtual screening compound libraries to generate high-quality compound collections and to understand bioactivity landscapes as a whole; (ii) statistical pattern analysis according to the original C@PA (Namasivayam et al., 2021a), which proved to give an important framework for the ongoing substructural evaluation; (iii) scaffold fragmentation and substructure hopping with positive and negative multitarget substructures. The extension of the positive and negative multitarget substructures proved to be a significant discovery in terms of the exploration of pattern analysis in our previous (Namasivayam et al., 2021b) and this work; (iv) individual and collective pattern analysis of featured substructures. Specific substructures will be of higher importance than others. The current work presents a detailed scheme on how these important substructures may be elucidated and harnessed for better discriminatory potential; and (v) adjustment of virtual screening parameters Table 1. Inhibitory activity of compounds 1-5 against ABCB1, ABCC1 and ABCG2 expressed as IC 50 values derived from full-down concentration-effect curves determined in plate reader-based calcein AM (ABCB1 and ABCC1) and Hoechst 33342 (ABCG2) assays, as well as flow cytometer-based daunorubicin (ABCB1 and ABCC1) and pheophorbide A (ABCG2) assays as reported earlier (Jekerle et al., 2006;Namasivayam et al., 2021a,b;Sibermann et al., 2019;Stefan et al., 2017) compared to IC 50 values of very weak pan-ABC transporter inhibitors as reported earlier (Colabufo et al., 2009;Lempers et al., 2016;Obreque-Balboa et al., 2016) and subsequent virtual screening. In the final step, gained knowledge from steps (i) to (iv) must lead to a synthesized discrimination scheme that sufficiently downsized ultra-large virtual screening compound libraries.
Feature-driven pattern analysis for the elucidation of the multitarget modulator landscapes can and will be used for other (human or non-human) pharmacological targets as well, for example, other membrane bound proteins [e.g. under-studied human/bacterial ABC transporters, G-protein coupled receptors (GPCRs), ion channels (ICs), solute carriers (SLCs; PANSLC) or tyrosine kinases (TKs)]. The landscape is not only bound to the biological activity (e.g. inhibition). Other biological properties may also be subject to this methodology, for example, the mode of action (e.g. activation or partial inhibition), the binding(-site) behavior (e.g. competitive, noncompetitive, or uncompetitive), the target localization/drug effect venue [e.g. cell membrane or intra-vesicular systems (Stefan et al., 2020a,b)] or the addressed regulatory pathway [e.g. induction or downregulation (Pahnke et al., 2021;Stefan and Wiese, 2019;Wiese and Stefan, 2019)]-invigorating drug design, discovery and development in general.